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We illustrate how the Matrix Element Method at Next-to-Leading Order (MEMONLO) can be 
used to discriminate between events arising from the production of a Higgs boson, which subse- 
quently decays to a final state consisting of £ + £~y, and the background production of the same final 
state. We illustrate how the method could be used in an experimental analysis by devising cuts on 
the signal (Ps) and background (Pb) weights that are computed event-by-event in this approach. 
We find that we can increase the S/ VB ratio by around 50% compared to an invariant mass fit on 
its own. Considering only statistical uncertainty, this is equivalent to recording a factor of around 
two times more integrated luminosity. 
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I. INTRODUCTION 

The recent discovery of a new boson with properties 
consistent with that of the Standard Model (SM) Higgs 
boson [TJ [5] , has indicated that the discovery of the elec- 
troweak (EW) symmetry breaking mechanism may be 
at hand. In order to confirm whether the new boson is 
indeed the SM Higgs, it is crucial to measure both its 
properties and branching ratios for the largest number 
of experimentally viable decay channels. These analy- 
ses could result in tension with the SM Higgs prediction, 
for instance the boson may differ in parity from the SM 
Higgs or even be a mixture of CP odd and even states. 
An additional possibility is that the rate for one or more 
measured decay channels is different from the SM predic- 
tion. The most obvious mechanism for such a scenario is 
an enhancement /suppression in loop-induced decays that 
are naturally sensitive to couplings to new virtual parti- 
cles, for instance the decay to two photons (77). This 
would thus be evidence for Beyond the Standard Model 
(BSM) physics. 

Another loop-induced Higgs decay is the decay to a 
final state containing a Z boson and a photon (H — > 
Z"/) [31 2] . Since it is a loop-induced process the branch- 
ing ratio is small and the decay of the Z boson to well- 
measured final state particles (p + ji~ or e + e~) means 
that the decay H — > £ + £~^ is a very rare SM process. 
However, this is not necessarily the case in extensions of 
the SM. In addition, the ratio of 77 to Z7 branching ra- 
tios can be used to discriminate between certain models 
of new physics [5HZ]. 

Observing the Higgs in the Zj final state is a difficult 
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feat. Firstly, the background production of Z"f is several 
orders of magnitude larger than the Higgs induced rate. 
The exact value of this ratio depends on the cuts defining 
the background cross section. In the region of invariant 
mass near the Higgs mass (m« 7 ^125 GeV), with typi- 
cal LHC cuts one would usually expect around 500-1000 
background events for each signal event. Secondly, the 
kinematics of the decay limit the final state photon to 
a challenging region of phase space. At Leading-Order 
(LO) the maximum transverse momentum {pr) of the 
final-state photon is restricted, since the final-state in- 
variant mass is close to m# and includes a pair of charged 
leptons of mass close to the Z mass [vfiz)- Since mz is 
not too far from run, the remaining energy to be im- 
parted to the photon lies in a limited range. Therefore 
the pt spectrum of the photon peaks around 30 GeV. 

Typically in such a soft region of phase space, QCD 
can provide large backgrounds to searches. This increases 
the difficulty in separating signal from background when 
compared with H — > 77, for which the photon px from 
the signal is significantly harder (p T ~ 60 GeV). Indeed, 
once the full detector simulation has been included, there 
are only small differences between the signal and back- 
ground shapes in the transverse variables [8] . In terms of 
final state kinematics, the main discriminating variable 
is the angle between the direction of the photon and the 
beam [§1[TU]. A spin-0 scalar is isotropic in this variable, 
whilst the background matrix element prefers emission 
in the forward region. A recent CMS study with around 
10 ftr 1 of 7 and 8 TeV data [5], set a limit around 10 
times the SM cross section. This result already disfa- 
vors scenarios in which the new boson is a pure pseudo- 
scalar since in some of these models the branching ratio 
to Z7 can be enhanced by up to 170 times the SM pre- 
diction [TT] . 

An experimental search for H — > Z7 should thus uti- 
lize as much theoretical information as possible in or- 
der to effectively reduce the unwanted background Zj 
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events. One such method is the Matrix Element Method 
(MEM) [12H16J . This method uses the matrix element 
associated with a given theoretical hypothesis to assign 
a probabilistic weight to an experimental event. Com- 
paring weights obtained by varying the theoretical hy- 
pothesis allows one to identify the most favorable one. 
Originally the method was used in order to perform a 
measurement of a known theoretical parameter, say for 
instance the top mass [TTrt5U] . More recently, the method 
has gained favor as an event by event discriminant [9] 1211 — 
125] . By using the matrix element one naturally includes 
all of the kinematic correlations present in the observed 
final state, and thus gains a large amount of theoreti- 
cal information. Until recently a major drawback of the 
MEM was its restriction to LO matrix elements. However 
in ref. [25] a new version of the MEM was proposed that 
can be extended to higher orders in perturbation theory 1 . 
Using the NLO method provides a much greater degree 
of theoretical reliability and control over the theoretical 
systematics. 

Recently the MEM has been used in searches for, and 
studies of, the Higgs boson. The MEM at LO has been 
applied to Higgs searches in the ZZ [5T] and Z^j [5] chan- 
nels. The MEM has also recently been applied to study 
the properties of the Higgs (decaying to two photons) 
via vector boson fusion [55] and to investigate its role 
in unitarizing WW scattering [53]. The Z7 search [5] 
used an implementation of the MEM that is restricted 
to LO, and considered the decay in an effective field 
theory approach. The authors found only a marginal 
improvement between the MEM and a simpler approach 
that only used m« 7 as the discriminant. With the recent 
CMS study to guide us [8j , the aim of this paper is to re- 
investigate the channel using the MEM@NLO algorithm 
and the full loop matrix element for the decay. 

This paper proceeds as follows. In section [TT] we discuss 
the H — > Zj calculation and the form of the matrix 
elements used in our analysis. Section [ITT] provides a brief 
overview of the MEM@NLO technique and we present 
our results in section IV Finally in section [V] we draw 
our conclusions. 



II. THE HIGGS DECAY TO Z-y 

In this section we briefly discuss the calculation of H — >• 
Z7 as it is implemented in the code MCFM [271 [28, ■ The 
H — > Z"f decay was first considered over twenty years 
ago [31 H]- We consider the process, 
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where the momenta are shown in parentheses. The 
squared matrix element for this loop-induced process has 



the following form 
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In this expression we have introduced the electroweak 
coupling e and the weak mixing angle 6\y and kinematic 
invariants are defined through s,y = (pi + pj) 2 . The left- 
handed and right-handed amplitudes are defined by, 
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in terms of the charge of the leptons and top quarks {Qt, 
Qt) the number of colours (N c ) and the top mass (mt). 
The vector couplings are defined as, 
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Finally the function Vz describes the Z propagator (with 
width Tz), 
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The loop integral functions are contained in Fyy and Ft, 
which are defined by, 
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and, 
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In these expressions Co and C 2 are standard Passarino- 
Veltman tensor integrals. Further details can be found 
in Refs. [MSJ. 

In addition to the decay described above, MCFM con- 
tains NLO Higgs and QCD Zj production, including the 
gg — > Zj loop induced process [55]. We will use these 
matrix elements to calculate our weights. 



III. THE MEMONLO TECHNIQUE 



1 With the caveat that the final state of interest should consist of 
only EW particles 



This section provides a brief overview of the MEM 
technique developed in Ref. [26 , to which we refer the 
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interested reader for a more complete discussion. The 
crux of the MEM method is to provide an event-by-event 
weight using the matrix element. At LO one defines each 
event to be weighted by the following quantity 

Plo{4>) — —— [ dx l dx 2 d(/>S(x 1 x 2 s - Q 2 ) 
olo J 

xf(x 1 )f i {x 2 )B ij (x 1 ,x 2 ,<P)W((l>,^). (9) 

Here f(x) represent the PDF with momentum fraction 
x, Q 2 is the invariant mass of the EW final state, 
represents the LO matrix element, which depends on the 
final state phase space point </>, which is derived from 
the input event from data (</>), via the transfer function 
W(4>,4>). The weights are normalized by the LO cross 
section (Tlo- Often in this paper we will use the following 
weight, which is defined in the limit of a perfect detector 
setup, i.e W(4>, <j)) = 8{<p — </>), 

Plo{4>) = —— / dx\ dx 2 S(x 1 x 2 s - Q 2 ) 
clo J 

xf(xi)f(^)B ij (x 1 ,x 2 ,4>). (10) 

This weight has the advantage of requiring fewer Monte 
Carlo integrations, and hence it is less computationally 
expensive. However, one must be confident that the 
analysis is not sensitive to such a simplifying assump- 
tion. For instance, in this study the narrow width of the 
Higgs would spoil this assumption, since any event with 
mu-y = trh would result in a large weight compared to 



where V represents the contributions from the Born ma- 
trix element, the interference between the Born and one- 
loop amplitudes and the integrated form of a relevant 
subtraction term (in this work we use a slightly modified 



In the above IZij represents the matrix element for the 
Born amplitude plus one additional parton, rendered fi- 
nite by the corresponding subtraction terms. Note that 
the constraining delta function for the PDF's has changed 
definition with respect to LO, Q 2 fbps 1S the invariant 
mass of the Born final state plus the NLO emission. For 
full details of the FBPS and subtraction setup we refer 



the remaining events in the sample, yielding unrealistic 
results. Therefore in order to use the above definition, 
care must be taken with variables (in this case the in- 
variant mass) that are extremely sensitive to detector 
resolution. We shall discuss this further in the next sec- 
tion. 

The observed EW final state typically recoils against 
hadronic activity that is not modeled in the leading order 
calculation. In order for the weights to be well-defined 
and unique one must therefore perform a boost to ensure 
that the final state <f> is balanced in pt- Then one can ap- 
ply the PDF weighting assuming two beams colliding in 
the z— direction. Since there are multiple Lorentz trans- 
formations satisfying these requirements which are con- 
nected by longitudinal boosts to each other, we integrate 
over the allowed range. This results in the correspond- 
ing integration over x\ (or x 2 ) in eq. ([9]). We refer to 
the set of pr balanced final state frames collectively as 
the MEM frame. We note that failure to perform this 
boost and subsequent integration results in a either an 
ill-defined (no-boost) or non-unique (no integration over 
boosts), and hence theoretically unreliable weight. 

The MEM frame allows calculation of weights accurate 
to NLO defined as [26], 

Pnlo(4>) = f d<f>(V(<P) + R{<t>))W{ct>, 0). (11) 

&NLO J 

The virtual corrections are expressed as, 
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Catani-Seymour [32] dipole approach). The real radia- 
tion pieces involve integration over an unresolved emis- 
sion for which we use the forward-branching phase space 
(FBPS) generator described in ref [33j [34] 



(13) 
I 

the reader to ref. |26) . 

The main difference between the LO and NLO MEM 
is the integration over the real phase space. Some events 
in the lab frame, when mapped to the MEM frame, no 
longer lie in the fiducial region defined by the lab frame 
cuts (which the weights are normalized to) and there- 
fore are assigned zero weight [2B]. At NLO these events 
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can have non-zero weights since the real emission con- 
tributions can boost these events back into the fiducial 
region. A simple way of interpreting this phenomenon is 
that NLO covers a larger kinematic phase space than LO. 
This larger phase space manifests itself as an ability to 
accept events which do not possess the correct kinematics 
to have arisen from our LO discriminant. This is one of 
the primary advantages of the MEM@NLO method com- 
pared to the MEM@LO (in addition to the usual increase 
in confidence in the understanding of the theoretical sys- 
tematic error from using an NLO prediction). Typically 
one finds that 0(as) (i-e. of order 10%) of the events 
fail the LO cuts. 

Each event in the data set can now be assigned a 
unique LO or NLO weight associated with a theoretical 
hypothesis controlled by the underlying matrix element. 
In our case we will assign it a weight based on the signal 
matrix element p H ~> z ~f r the irreducible background 
production of Zj, P Zl . One can use these individual 
quantities to build discriminants. Unless stated other- 
wise, our default is to use NLO matrix elements in our 
weight calculations. 



IV. RESULTS 

In order to study the MEM@NLO for the H ->• Z7 de- 
cay mode we generate samples of signal and background 
events. We do this using the SHERPA event genera- 
tor [35]. For the background we generate a CKKW [36] 
matched sample of Z7 events. For the signal events we 
generate NLO matched Higgs events for m# = 125 GeV. 
These Higgs events are then subsequently decayed to the 
Z^f final state using the MCFM implementation that is 
described in Section [IT] Since the Higgs is a scalar parti- 
cle, production and decay are uncorrelated. This allows 



us to simply calculate the decay using MCFM in the rest 
frame of the Higgs and then boost it so that it has the 
four-momentum of the SHERPA event. Throughout our 
studies we will use the CT10 PDF set [37] , 

The above procedure produces events at the particle 
level. However, in order to study the light Higgs in a 
meaningful way one must include some kind of detector 
simulation. This is because the light Higgs has such a 
narrow width that the m« 7 spectrum is dominated by 
the detector resolution. For example, the CMS technical 
design report |38| estimates a resolution of photon energy 
using the following, 



3.6% 



18.5% 



\ E 1 J ^/KjGeV £ 7 /GeV 



© 0.66% (14) 



where A_E 7 represents the width of the Gaussian smear- 
ing and © indicates that the quantities are to be added 
in quadrature. At E — 30 GeV this provides a width 
of around 0.3 GeV. Using this smearing (and the equiv- 
alent leptonic quantity) we find a Higgs boson lineshape 
which is too narrow compared to that recently reported 
by CMS [8] , where the effective Gaussian width quoted is 
around 3 — 4 GeV. Therefore in order to match onto the 
results of this paper we inflate our Gaussian smearing to, 



AE 7 = 2 GeV, 



AE f = 0.5 GeV. 



(15) 



Our resulting lineshape for the Higgs is now in good 
agreement with ref. [8]. Note that, since our enhanced 
width is around a factor of six greater than that arising 
from the energy dependent piece eq. ( 14 ) , we drop the 
energy dependence for simplicity. 

After smearing our events as described above we apply 
the following lab frame cuts, 



pl>15GeV, |r? 7 |<2, p^>20GeV, \rj i \<2, 60 < m u < 120 GeV, 115 < m« 7 < 135 GeV, R tj > 0.7 . (16) 



Note that we have kept the cuts on the lepton and 
photon pt loose. Part of the attraction of the MEM 
discriminant is that it will naturally select events which 
have the correct kinematics to be signal events, therefore 
one does not have to spend time attempting to optimize 
kinematic selection criteria. Of course, if some observable 
clearly discriminates signal from background, cuts on this 
quantity should be applied in order to reduce the overall 
computational load. For this reason we impose a tight 
cut on the invariant mass of the Zj system, centered on 
125 GeV. 



A. Generation of reducible background events. 

In order to effectively simulate LHC conditions we 
must also consider events which do not arise from the 
irreducible Z7 background, but instead are misidentified 
in the detector. Since they are naturally very dependent 
on the exact detector setup and modeling, such events 
are difficult to accurately simulate in our study. However 
since they are a large fraction of the resulting event sam- 
ple [5], it is necessary to attempt to provide an estimate 
of our discriminant on a "fake" sample. To this end we 
generate a sample of fake events in the following way. We 
assume that the dominant component of the fake events 
results from Z+jet events in which the leptons from the Z 
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decay are clean, but the jet is mismeasured as a photon. 
As a crude model, we use SHERPA to generate Z+jet 
events and then smear the pr and r] of the (leading) jet 
by Gaussian functions with a width of 10 GeV and 0.5 
respectively. Our event sample is then generated by ap- 
plying the cuts described above in the previous section to 
the smeared events, treating the smeared jet as a photon. 



B. The MEM@NLO as a kinematic discriminant 

We first discuss our definition of the Higgs signal hy- 
pothesis, which is particularly important because of the 
very narrow SM width. One approach is to define a 
weight for each event by integrating over transfer func- 
tions that model the detector resolution, as in eq. (9j. 
This approach allows one to test a single Higgs mass hy- 
pothesis for a given set of events, but requires additional 
integrations per event. An alternative approach is to 
change the Higgs mass hypothesis on an event-by-event 
basis by choosing mu — ™« 7 - In this scenario one ef- 
fectively changes the propagator in the matrix element 
to, 
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(s - m H ) 2 + iT H m H iT H m Uj 



(17) 



Since Tjj is very small this approach makes each event 
have a large P$ and the collection of events can no 
longer define a probability density function. In addition 
Ps > Pb even for background event samples. However, 
one still expects discriminating power between signal and 
background since Ps arising from the events which match 
onto the signal hypothesis will be larger than that for 
Ps from the background. Finally we note that the (sig- 
nal) normalization is defined uniquely for each event by 
a(m H = m Ul ). 

This technique has been used extensively in studies 
involving kinematic discriminants in H — > ZZ — > Al 
(2~4l, [25] , The advantage of this technique is that there 
are less integrations per event and thus the weights are 
computationally less expensive. One can then restore 
the invariant mass m« 7 as an additional discriminant 
since Higgs events will cluster in invariant mass whilst 
the background will be more diffuse. We will adopt this 
approach for the remainder of this paper. 

With the event samples generated as described in the 
previous sections we can now introduce our discriminant 
T>. There is a range of possibilities but in this paper we 
will choose, 



V = - log 



Pb 



Ps 



(18) 



Events which arise from background should have larger 
Pb than Ps so the ratio in the logarithm is near one. 
As a result, events with T> nearer zero should be more 
background-like than signal. 



We present results for V for our three different event 
sample classes in Fig. [TJ The results are shown as two- 
dimensional histograms, binned both by the discriminant 
T> and the invariant mass, mu-y- In addition, we also 
show one-dimensional projections of these histograms, as 
a function of T> only. As expected the signal events peak 
at larger T> than the corresponding background distribu- 
tion. The background and fake samples have roughly 
similar shapes (indicating some of the similarities be- 
tween Z+jet and Zj). Although the signal shapes are 
similar to the background, there are still significant re- 
gions that are only populated by background events (but 
which may still have an invariant mass in the Higgs win- 
dow). In particular, both the background and fake sam- 
ples have significant tails in the lower D region, whereas 
the signal sample does not. For example, there are barely 
any ( ~ 0.5%) of signal events with T> < 6. On the other 
hand around 10% of the background events lie in this 
region. Cutting at T> > 6 would thus be an almost zero- 
cost reduction in signal at the expense of a non-trivial 
background number of events. 

The scales on the two-dimensional histograms illus- 
trate the stark differences between the signal and back- 
ground events in the (2?,m« 7 ) plane. The area of high- 
est density for the signal events (around the truth value, 
mu-y = 125 GeV) is around three times greater than the 
corresponding highest density region for the background 
(which is at much lower invariant mass). Retaining only 
the events that satisfy T> > 7, one rejects 21% of the irre- 
ducible background events and keeps 93% of the signal. 
A higher cut, T> > 8 rejects 64% of the background and 
retains 55% of the signal. In an experimental analysis 
one would thus choose the optimal value of T> at which 
to cut in order to optimize the signal to background ra- 
tio. Since our model of the fakes is less developed than 
our signal and background models we optimize our cut 
on the discriminant on the combination of signal and ir- 
reducible background samples. We find a value of the 
cut at V > 7.5 corresponds to a signal efficiency of 81%, 
with an background rejection of 37%. We note that here 
we have chosen a fairly simple cut on T> that optimizes 
S/ \f~B. One could instead perform counting experiments 
using contours in the (2?,m« 7 ) plane although such a 
study is beyond the scope of this work. 

We plot the invariant mass m« 7 before and after our 
cut (V > 7.5) for our three samples in Fig. [2j Each 
sample is weighted to reflect the number of events ex- 
pected, given the total number of irreducible background 
Zj events. We weight our signal sample by the ratio of 
cross sections (including a NLO to NNLO if-factor of 
1.2 [SH]). We normalise our fake sample to be compati- 
ble with that reported by CMS [8] , namely by fixing the 
number of fake events to be one third of the irreducible 
background. Our cuts have altered the shape of the back- 
ground and fake samples, whilst maintaining the overall 
shape of the signal. 

Ultimately we would like to investigate the efficiency of 
this method in the vicinity of the Higgs signal at wih — 
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Figure 1: Distribution of events in terms of the invariant mass of the final state, m« 7 and our discriminant T> defined in 
eq. (18 1. The two-dimensional histograms (left) present the density of events in the plane of T> and muf The right hand 
panels represent the distribution of T> for our signal (top row), background (middle row) and fake (bottom row) samples. Each 
sample is normalized to the total number of events in the sample. 
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Figure 2: Invariant mass distributions (for mu-y) before (left panels) and after (right panels) our analysis cut T> > 7.5. 
Distributions are shown for the signal events (top row), the irreducible background (middle row) and fakes (final row). The 
number of events in each distribution is normalized to the irreducible background sample without application of any cuts on 
D, as described in the text. The red curve indicates a Gaussian (polynomial) fit to the signal (background) data. 



125 GeV. We therefore define a window, 
122 < m Ul < 128 GeV , 



(19) 



where the width has been optimized for the analysis be- 
low. We attempt to quantify the improvement the cut on 
T> has made in the following way. We define the quantity, 



y/N. 



Zy "I" Nf akes 



(20) 



where Nx represents the expected number of events for 
process X. Our measure includes no treatment of system- 
atic errors and instead only assumes the S/y/B scaling 
of the statistical uncertainty. In spite of its shortcomings 
compared to the full experimental analysis, a can pro- 
vide us with an estimate of the improvement one might 
envisage after applying our cut. We find, 



£*D>7.5 



= 1.52. 



(21) 



Since a scales as £" 1 / 2 , using a cut of D > 7.5 is (sta- 
tistically) equivalent to taking 2.31 times more data. 

Before concluding, we will briefly consider the impact 
of using the leading order method, MEM@LO rather 
than the NLO one. We find that the fraction of events 
which fail the fiducial cuts at LO is larger for the Higgs 
signal than for the irreducible background. As a re- 
sult the MEM@NLO produces slightly better signal over 
background ratios than the MEM@LO. For example we 
find. 



"£>>() 
LO 

*D>7.5 



1.41, 



(22) 



which is 7% smaller than the corresponding NLO value. 
This small difference, between the LO and NLO anal- 
yses, indicates that the method is perturbatively stable 
and the theoretical systematic uncertainty is under good 
control. 



V. CONCLUSIONS 

In this paper we have presented an application of the 
MEM@NLO to searches for the Higgs boson in the decay 
channel Z^j. This channel is extremely challenging ex- 
perimentally as can be seen from the preliminary results 
from CMS [8]. The reasons for these difficulties are two- 
fold. Firstly the H — > Z-f branching ratio is already very 
small, even before the requirement that the Z-boson de- 
cays to muons and electrons only. Since the background 
production of Z in association with a photon is large, one 
naturally has low signal to background ratios. Secondly 
the kinematics of the decay for a Higgs boson with mass 
rriH ~ 125 GeV force the final state photon to have a 
relatively soft px- The matrix element has a soft singu- 
larity as p7p — > and therefore the background is very 



large in the region in which the Higgs signal peaks. Once 
detector effects are included there is very little difference 
between the signal and background events in the trans- 
verse variables. 

Given these difficulties, it is essential to utilize all the 
remaining differences between the signal and background 
processes. One approach, the matrix element method, 
uses a theoretically defined matrix element to assign a 
weight to each experimental event. When there is a good 
match between the theoretical hypothesis and the input 
events the weights become larger. Therefore one can use 
the MEM to produce samples of events which increase 
the signal to background ratio for a certain theoreti- 
cal hypothesis. Recently the MEM has been extended 
to NLO for electroweak final states [55]. We used this 
MEM@NLO to calculate signal and background discrim- 
inants for a sample of events generated using SHERPA. 
Our event sample included showered and hadronized 
Higgs signal and SM background events as well as a crude 
model of Z+]et fake events. Higgs decays to Zj were gen- 
erated using the MCFM implementation. Our estimates 
of resolution effects and fake rates were guided by the 
recent results from CMS presented in Ref. [5]. 

We used the MEM@NLO to construct a discriminant 
(D) from the event-by-event weights Ps (using the sig- 
nal matrix element) and Pb (the background matrix el- 
ement). In defining these weights we removed the in- 
variant mass as a discriminating variable. As a result 
we were subsequently able to create a two-dimensional 
discriminant in T> and m« 7 . In this plane the signal 
events cluster around mn and at higher T> compared to 
those arising from the background and fakes. Therefore, 
by cutting on T> and m« 7 , we were able to improve our 
measure of the signal significance, S/y/B. We found that 
S/y/B increased by around a factor of 1.5 compared to 
the value obtained without any cut on T>, suggesting that 
roughly half as much data would be needed to obtain the 
same limit on H ->■ Z7. We found that the MEM@LO 
algorithm is also able to provide S/y/B improvements by 
a factor of around 1.4, approximately 10% less efficient 
than the MEMONLO. 

This search has provided an example of the power of 
the matrix element method in a worst case scenario for 
a traditional analysis. We hope that the ideas presented 
in this paper are useful to our experimental colleagues 
in the hunt for the Higgs boson in this difficult channel. 
Code which calculates the weights described in this paper 
is available upon request. 
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